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Abstract 

In this paper, we consider a two-way relay network where two sources can communicate only 
through an untrusted intermediate relay, and investigate the physical layer security issue of this two- 
way relay scenario. Specifically, we treat the intermediate relay as an eavesdropper from which the 
information transmitted by the sources needs to be kept secret, despite the fact that its cooperation in 
relaying this information is essential. We indicate that a non-zero secrecy rate is indeed achievable in 
this two-way relay network even without external friendly jammers. As for the system with friendly 
jammers, after further analysis, we can obtain that the secrecy rate of the sources can be effectively 
improved by utilizing proper jamming power from the friendly jammers. Then, we formulate a 
Stackelberg game model between the sources and the friendly jammers as a power control scheme 
to achieve the optimized secrecy rate of the sources, in which the sources are treated as the sole 
buyer and the friendly jammers are the sellers. In addition, the optimal solutions of the jamming 
power and the asking prices are given and a distributed updating algorithm to obtain the Stakelberg 
equilibrium is provided for the proposed game. Finally, the simulations results verify the properties 
and the efficiency of the proposed Stackelberg game based scheme. 
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I. Introduction 

Traditionally security in wireless networks has been mainly considered at higher layers 
using cryptographic methods. However, recent advances in wireless decentralized and ad-hoc 
networking have led to an increasing attention on studying physical layer based security. The 
basic idea of physical layer security is to exploit the physical characteristics of the wireless 
channel to provide secure communication. The security is quantified by the secrecy capacity, 
which is defined as the maximum rate of reliable information sent from the source to the 
intended destination in the presence of eavesdroppers. This line of work was pioneered by 
Aaron Wyner, who introduced the wiretap channel and established fundamental results of 
creating perfectly secure communications without relying on private keys H). Wyner showed 
that when the eavesdropper channel is a degraded version of the main channel, the source 
and the destination can exchange perfectly secure messages at a non-zero rate. In follow-up 
work [2J, the secrecy capacity of Gaussian wiretap channel was studied, and in 01 Wyner's 
approach was extended to the transmission of confidential messages over broadcast channels. 
Recently, researches on physical layer security have generalized these studies to wireless 
fading channels flU-Q, MIMO channels |[8l- lfT2l . and various multiple access scenarios 

ITT3TI— ITT8TI. 

Motivated the fact that if the source-wiretapper channel is less noisy than the source- 
destination channel, the perfect secrecy capacity will be zero [3], some recent work has 
been proposed to overcome this limitation using relay cooperation, which mainly consists of 
cooperative relaying [[191 . Il20l . and cooperative jamming ETTl . Il2~2~ll . For instance, in lfl9ll and 
[20 1, the authors proposed effective decode-and-forward (DF) and amplify-and-forward (AF) 
based cooperative relaying protocols for physical layer security, respectively. Cooperative 
jamming is another approach to improve the secrecy capacity by distracting the eavesdropper 
with codewords independent of the source messages. In [1211 and [|22l . several cooperative 
jamming schemes were investigated for different scenarios, where classical relay strategies 
fail to offer positive performance gains. Relay channel with confidential messages was studied 
in 11231 and 11241 . where the relay node acts both as an eavesdropper and a helper. In If25ll . 
it was established that cooperation even with an untrusted relay node could be beneficial 
in relay channels with orthogonal components. Then in ll2~6ll . the authors considered a two- 
hop communication system using an untrusted relay and showed that a cooperative jammer 
enables a positive secrecy capacity which would be otherwise impossible. 

Two-way communication is a common scenario where two terminals transmit information 
to each other simultaneously. Recently, the two-way relay channel ll2~7Tl - |[3Tll has attracted 
lots of interest from both academic and industrial communities due to its advantage in saving 
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bandwidth efficiently. In [27] and 11281 . both AF and DF protocols for one-way relay channels 
were extended to general full-duplex discrete two-way relay channel and half-duplex Gaussian 
two-way relay channel, respectively. In [29|, different relay strategies consisting of AF, DF, 
and EF (estimate-and-forword) for uncoded two-way relay channels were investigated. In 11301 . 
analogue network coding based two-way relay channel with linear processing was analyzed 
and an optimal relay beamforming structure was presented. In OT1 . a joint network-channel 
coding was proposed for the two-way relay channel, where channel codes are used at both 
the sources and a network code is used at the intermediate relay. Although two-way relay 
networks have received so much attention so far, the security issue about the relay, especially 
from the physical layer security point of view, has not been well investigated. 

To improve the security in two-way relay channel, distributed protocols are desired. Game 
theory 021 offers a formal analytical framework with a set of mathematical tools to study 
the complex interactions among interdependent rational players. Recently, there has been 
significant growth in research activities that use game theory for analyzing communica- 
tion networks, mainly due to the need for developing autonomous, distributed, and flexible 
mobile networks where the network devices can make independent and rational strategic 
decisions, as well as the need for low complexity distributed algorithms for competitive or 
collaborative scenarios 03~1 . In 041 and 051 . the authors introduced some recent studies 
on signal processing and communication networks using game theory. In P6l and 071 . the 
authors employed game theory to physical layer security to study the interaction between the 
source and the jammers who assist the source by distracting the eavesdropper, and got some 
distributed game solutions. For two-way relay networks, it is desirable to study the physical 
layer security problems with the aid of game theory similar to those for one-way relay cases. 

In this paper, we investigate physical layer security issues in a two-way relay network with 
friendly jammers. The two sources can exchange information only through an untrusted relay, 
as there is no direct communication link between them. The untrusted AF relay acts as both 
an essential relay and a malicious eavesdropper that has the incentive to eavesdrop on the 
information transmission. For convenience and ease of comparison, we first study the system 
without friendly jammers as a special case. We find that a non-zero secrecy rate here is 
indeed available even without the help of friendly jammers. We also derive an optimal power 
vector of the relay and the sources by maximizing the secrecy rate. We further investigate 
the two-way relay secure communication with friendly jammers, and find that a positive gain 
can be obtained in the secrecy rate by utilizing proper jamming power from the friendly 
jammers. Then the problem comes to how to effectively utilize the jamming power from 
different friendly jammers to maximize the secrecy rate. Thus, we propose a Stackelberg 
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game model between the sources and the friendly jammers as a power control scheme. In the 
defined game, the sources pay the friendly jammers for interfering the untrusted relay in order 
to increase the secrecy rate, while the friendly jammers charge the sources with a certain 
price for their service of jamming. In addition, the optimal solutions of the jamming power 
and the asking prices are given and a distributed updating algorithm to obtain the Stakelberg 
equilibrium is provided for the proposed game. Furthermore, a centralized scheme is also 
proposed for comparison with the distributed Stackelberg game based scheme. Finally, the 
proposed approaches and solutions are verified by simulations. 

The rest of this paper is organized as follows. In Section II, the system model of two-way 
relay communication with friendly jammers is described and the corresponding secrecy rate 
is formulated. In Section III, a two-way relay system without jammers as a special case is 
investigated. In Section IV, we define a Stackelberg type of buyer/seller game to investigate 
the interaction between the sources and the friendly jammers, and analyze the optimization 
problem of physical layer security in the presence of friendly jammers. Simulation results 
are provided in Section V, and the conclusions are drawn in Section VI. 

II. System Model 

As shown in Fig. [H we consider a basic two-way relay network consisting of two source 
nodes, one untrusted relay node, and N friendly jammer nodes, which are denoted by Sk, k = 
1, 2, R, and Jj, i = 1, 2, . . . , N, respectively. We denote by J\f the set of indices {1,2,..., N}. 
All the nodes here are equipped with only a single omni-directional antenna and operate in 
a half-duplex way, i.e., each node cannot receive and transmit simultaneously. Then the 
complete transmission can be divided into two phases. During the first phase, shown with 
solid lines, both source nodes transmit their information to the relay node. Simultaneously, the 
friendly jammers also transmit the jamming signals in order to distract the malicious relay. In 
the second phase, shown with dashed lines, the relay node broadcasts a combined version of 
the received signals to both source nodes. Note that in the system we investigate there is only 
one intermediate relay and we assume that no direct link exists between the two sources. 
Thus, the sole untrusted relay is necessary for the two-way relaying data transmission. A 
key assumption Q we make here is that the sources have perfect knowledge of the jamming 

'We can guarantee this by using some pseudo-random codes which are known to both the friendly jammers and the 
sources but not open to the malicious relay. Beyond this, we can also use some cryptographic signals at the friendly 
jammers for jamming, where the decryption book is a secret key only open to the sources. Then the sources can have a 
perfect knowledge of the jamming signals if each jammer sends some additional bits consisting of the information of the 
jamming signal transmitted (e.g., which code or which encryption method to be used). The information that needs to be 
sent is for one time, which will lead to trivial bandwidth cost. 
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signals transmitted by the friendly jammers, for they have paid for the service. 

Let s\, s 2 , and sf, i G J\f, denote the signal to be transmitted by the source Si, S 2 , and the 
jammers J t , i G N , respectively. Suppose source nodes Si and S 2 transmit with power pi 
and p 2 , and the channel gains from the source nodes to the relay node are denoted by hs k ,R, 
k = 1,2. Each friendly jammer node i transmits with power p{, and the channel gain from 
it to the relay node is denoted by hj^R, i G M. The channel gain contains the path loss and 
the Rayleigh fading coefficient with zero mean and unit variance. For simplicity, we assume 
that the fading coefficients are constant over one frame, and vary independently from one 
frame to another. 

In phase 1, the received signal at the malicious relay can be expressed as 

y r = n r + y/plsih SuR + y/pls 2 hs 2 ,R + ^ \fpi s i h Jr,R' 0) 

i 

where n r denotes the thermal noise at the relay node R, which is a zero mean Gaussian 
random variable with two sided power spectral density of a 2 , i.e., CJ\f(0,a 2 ). Furthermore, 
we assume that the noises at Si, S 2 , and R are independent and identically distributed (i.i.d.). 

In phase 2, the malicious relay, which works in AF mode, amplifies the received signal 
y r by a factor (3 and then broadcasts the signal to both Si and S 2 with power p r . The power 
normalization factor (3 at the relay node can be written as 

P = Li \h Sl , R \ 2 +P2 \h S2 , R \ 2 + Y^Pi \ h Ji,R\ 2 + ■ ( 2 ) 

Then the corresponding signal received by the source Si , denoted by y l9 can be written as 

Vi = Py/jbhsuRVr + ni = + v v s 2 + } J fJ>i,jSj + ui, (3) 

i 

where & = f3y/p r Pih 2 SliR , v x = P y /p r P2hs 1 ,Rh S2 ,R, = Py^pih JitR h Sl ,R, and u x = 
ft 'y/p~^h SltR n r + ni. Similarly, the signal received by the source S 2 , denoted by y 2 , can be 
written as 

2/2 = PVp^h S2:R y r + n 2 = £ 2 Si + v 2 s 2 + ^ ^,i s i + u 2, (4) 

i 

where £ 2 = Py/PrPih SliR h S2 , R , v 2 = (3y/p r P2h 2 S2iR , n 2 ,i = Py/p r Pihj iiR h S2jR , and u 2 = 
fisJp~ r hs 2 ,Rn r + n 2 . 

Assuming that both the source nodes and the jammer nodes are independent, from (Q~|), 
in phase 1, using the matched filter (MF) |f[ the untrusted relay node has the capacity with 



2 For simplicity, we use the matched filter for signal detection | 38| while many other advanced detectors can be applied 
and the analysis can be done in a similar way. 



6 



respect to Si and S 2 as 



cr = fiog i+ ^ "^fL , i 

2 I ff 2 + p 2 £/5 2 ,i? + E Pi9Ji,R 



and 



W i d , P29s 2 ,R , 
°2 = lo g 1 + -5-; 1 ^ j . (6) 

where represents the channel bandwidth, g SuR = \h SltR \ 2 , g S2:R = \h S2 , R \ 2 , and g JuR = 

\h Ji!R \ 2 , i e M. 

In phase 2, at Si, as s 1 as well as sf is known to the source node, and thus we have 

yi = vis 2 + wi. (7) 

Then, the corresponding SNR for the transmission from S 2 to Si, denoted by 72, can be 
expressed as 

72 = N' = (8) 

where JT 2 = - ^ pigSl ^J 2 ^ S2 '- R+cr \ Similarly, at 6*2, the received signal with s 2 and sj removed 
can be written as 



m = 6*1 + w 2- (9) 

The corresponding SNR for the transmission from Si to S 2 , denoted by 71, can be expressed 



as 

= l^l 2 = Plgft.fl (1Q) 

Var{u 2 } a 2 + K 1 + r^M p f' 

where ^ = ^(Pl gSl ,*+ P2gS2 , fl + ^ 

Capacities of two-way relay channel between the two sources are denoted by G\ and C 2 , 
and we have 

Ci = yl0g(l + 7i), (11) 

and 

C 2 = ylog(l + 72 ). (12) 
Then, the secrecy rate for Si and S 2 (H can be defined as 

a = (d - cr) + , (13) 
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and 



CJ = (C 2 - c 2 m ) 



(14) 



where (x) + represents max{x,0}. According to G51l and (261, we have that the defined 
secrecy rate is achievable in a two-hop secure communication with an untrusted relay. 

As a special case, if the jammers are not used, the jammers' transmit power pj should be 
set to zero, Vi G M. Then from the derivation above, we can get the corresponding secrecy 
rate in this case as 



In the system we investigate, there is only one intermediate relay, thus this sole relay is 
necessary in our assumption for two-way relaying data transmission. Actually, the untrusted 
relay has the incentive to forward the signals from both the sources since it can eavesdrop on 
the information transmission through this kind of cooperation. If the relay is non-cooperative 
that it only receives but not relays the information, then the problem comes to deny-of-service 
attack. However, this can be easily detected by the sources, then the non-cooperative relay 
will be treated as a thorough eavesdropper and lose the good opportunity to eavesdrop on 
the information transmission. The sources will then turn to another intermediate relay for 
help to relay their information in a practical scenario where there exist multiple intermediate 
relays. In this paper, we focus on the studies how to prevent the untrusted but necessary 
intermediate relay from eavesdropping the information, and thus, for simplicity and without 
loss of generality, we assume that there is only one necessary intermediate relay in the system 
and the relay is cooperative. 

III. Secrecy Rate of Two- Way Relay Channel Without Jammers 

For comparison and consistence, we first investigate the special case without the presence 
of jammers in this section. We prove that there indeed exists a positive secrecy rate for the 
two-way relay channel even without the help of friendly jammers distracting the malicious 
relay. Furthermore, we also obtain an optimal power allocation of the sources and the relay 
to maximize the secrecy rate. In the next section, we will compare the case with friendly 
jammers with this case to expect a positive performance gain in the secrecy rate. 




(15) 



and 




(16) 



s 



A. Existence of Non-zero Secrecy Rate 

When the eavesdropper channels from the two sources to the malicious relay are degraded 
versions of the equivalent main two-way relay channel between Si and £2, the two sources can 
exchange perfectly secure messages at a non-zero rate. Firstly, we consider the transmission 
from Si to S2. In phase 1, the malicious relay receives the signal si from Si, which consists of 
the information for S , 2 . Meanwhile, S2 also transmits the signal s 2 at the relay, which acts as 
both the information carrier for Si and a jamming signal that makes the eavesdropper channel 
from Si to the malicious relay getting worse. In phase 2, the combined signal consisting of 
si and s 2 arrives at S , 2 . As S% has a perfect knowledge of its own signal s 2 , the signal that 
jammed the malicious relay in phase 1 has no such an effect on S 2 . Therefore, it makes 
possible that the eavesdropper channel is worse than the data transmission channel from Si 
to S2, which means that a non-zero rate for secure communication from Si to S2 is available. 
It is the same situation in the transmission from S 2 to Si. From (PT5T) . (PT6l) and the expressions 
of Ki in (flOl) and K 2 in ©, we can write the probability of the existence of a non-zero 
secrecy rate as 



where T = {pig Su R + Pigs 2 ,R + v 2 )<J 2 . 

Considering the power constraints pi < p max , p 2 < p max , and p r < p max , we can get that 
there exists at least one pair of (p r ,pi,p 2 ) that satisfies p r > max < — % — , — % — L under 



the channel condition of 9S ^ R+9S ^ R < £»«£. Therefore, we have P [Cf >0,CS >0) > at 
some power vectors of (p r ,pi,p 2 ), which actually indicates that a non-zero secrecy rate in 
the two-way relay channel is indeed available. 

B. Maximizing the Secrecy Rate 

In this subsection, we try to get an optimal power vector of (p r ,Pi,P2) which maximizes 
the secrecy rate of the two-way relay channel. We can formulate the problem subject to the 
individual secrecy rate constraints and power constraints as 




(17) 




2 



max C s = max 




(18) 



k=l 




max 
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From (fTTT) . we have that 



T T 

C\ > 0, C s 2 > <=> p r > max <J = — , = — J> . (19) 

P29 2 S2 , R Pi9 2 Si ,r 



From (IT51> . (fT6l) . and (fT8l) . we can get that 

W 



C s = Y (\ogF(p r ,p uP2 ^ , (20) 
where 



( i + EH^r) ( ] 

F(Pr,p„ P 2) ± -, y * +Ki (\ N . (2D 



| I P29S 2 ,R 
1 CT 2 + ^- 2 



^ , Plg^fl 1(14- P2gS 2-R 



As F(p r ,pi,p 2 ) has the same monotonic property as C s under the conditions of (TT8T) . we 
can transform the optimization problem as 

max F(p r ,p 1 ,p 2 ) , (22) 



s.t. 



p r > max 



r t 



Pl<P max j P2<P max i 



mitt 



It can be calculated that dF ( p ^ 1 ' P2 ') > is always established under the conditions of (1221) . 
which implies that F (p r ,p 1 ,p 2 ) is a monotonically increasing function of p r . Therefore, 
when maximizing the secrecy rate C s , p r _ op t = Pmax, where p r _ op t denotes the optimal relay 
power [^|. As a result, the problem can be further transformed into max F (p max ,pi,p 2 ). 

The optimal solutions of p\ and p 2 when maximizing the secrecy rate can be easily obtained 
under different conditions (i.e., g SuR > g S2 , R , gs u R < 9s 2 ,R, an d gs lt R = 9s 2 ,r) through the 
Lagrangian method by solving the Karush-Kuhn-Tucker (KKT) conditions H3l . In this paper, 
subject to the space limit, we omit the detailed computing process and only give the results 
of the optimal solutions of p\ and P2 as: 

1) For the case that gs lt R > 9s 2 ,Ri ^ yields that p2_ op t = Pmax- Meanwhile, if there exists 
a solution p\ e {0,p m ax} that satisfies the equation 9f (p™*>pi>p™">) = o, then we have 
Pi_ op t = Pi- Otherwise, we have px_ opt = p ma x- Here p^t and p 2 _ op t denote the optimal 
power transmitted by Si and S2, respectively. 

2) For the case that gs 1>R < gs 2 ,R> it yields that pi_ opt = p ma x- Meanwhile, if there exists 
a solution p* 6 (0,p m ax] that satisfies the equation 8F(Pma g ma3: ' P2) = 0, then we have 
P2_o P t = P* 2 - Otherwise, we have p 2 _ opt = p max . 

3) For the case that g Slt R = gs 2 ,R, we have that pi_ opt = p max , and p 2 _ op t = Pmax- 



3 Note that here we calculate the optimal power solution of p r only from a mathematical perspective to maximize the 
secrecy rate. In fact, the intermediate relay has no incentive to transmit with the maximum power. 
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In addition, we verify these optimal solutions by simulations in different cases which is 
shown in Fig. [2] and Fig. [3] in Section- V, and after further calculation based on the simulation 
results, we can get that the results agree with the optimal power solutions well. 



In this section, through further analysis, we first find that the secrecy rate of the sources can 
be effectively improved by utilizing proper jamming power from the friendly jammers. Then, 
the problem comes to how to control the jamming power from different friendly jammers 
when optimizing the secrecy rate of the sources. In general, in a cooperative wireless network 
with selfish nodes, nodes may not serve a common goal or belong to a single authority. Thus, 
a mechanism of reimbursement to the friendly jammers should be employed such that the 
friendly jammers can earn benefits from spending their own transmitting power in helping the 
sources for secure data transmission. For the source side, the sources aim to achieve the best 
performance of secrecy rate with the friendly jammers' help with the least reimbursements 
to them. For the friendly jammer side, each friendly jammer aims to earn the payment not 
only covers its transmitting cost but also gains as many extra profits as possible. Therefore, 
we employ a Stackelberg game model [32J as a power control scheme jointly considering 
both the benefits of the sources and the friendly jammers. In the Stackelberg game model we 
proposed, the two sources as a unity is the sole buyer that starts the process of the proposed 
Stackelberg game, and the friendly jammers are the sellers, therefore, the sources are treated 
as leader while the friendly jammers are the followers. Furthermore, the optimal solutions of 
the jamming power and asking price are investigated and a corresponding distributed updating 
algorithm is provided. Finally, a centralized scheme is proposed for performance comparison. 

A. Secrecy Rate Improvement using Friendly Jammers 
From (PT3l and ([14l) . we have 



IV. Physical Layer Security with Friendly Jammers 



r / 

W 

C{ = - log 1 + 

I V 



a 2 + K x + £ 



Pl9S u R 



LI rp<> 

p r g.s 2 ,R^ 1 J 



\ 




(23) 
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and 



C S 



w 
Y 



log 



1 + 



P29S2,R 



n + 



-log 1 + 



P29S2,R 



V 2 +Pl9S 1 ,R + J29Ji,RPi 



(24) 



From (1231) and (|24l) . we can see that both C fe and C™, = 1, 2, are decreasing and convex 
functions of jamming power pf, i e J\f. However, if C™ decreases faster than as the 
jamming power pf increases, Cf. might increase in some region of value pf. But when pf 
further increases, both Ck and C™ will approach zero. As a result, C| approaches zero. 

2 2 

Compared to (QJ) and (Ql), we can get that if ^g^f < g JuR and £J£^ < g JitR , Vz G A/", 



p r gs 1 ,R 



i.e., 2- < min ^^r}, the gain of the secrecy rate will be above zero in some region 

of the jamming power pf. Then the problem comes to how to utilize the jamming power 
from different friendly jammers effectively to maximize the secrecy rate. Thus, we propose 
a Stackelberg game model to achieve effective jamming power control in the following 
subsections. 

Note that synchronization among the sources and the friendly jammers is important in 
the investigated system with friendly jammers. Many works have been devoted to the syn- 
chronization issues among distributed nodes in cooperative networks, for example in ll44l . 
[|45ll . effective synchronization schemes among distributed sensors and cooperative relays 
with low complexity and good performance were proposed. Thus, the synchronization issue 
among the sources and the friendly jammers can be addressed effectively using methods 
similar to those proposed in B4l . B51l . However, this is not the key investigated issue in this 
paper, therefore, we assume that perfect synchronization among the sources and the friendly 
jammers is implemented in the system. 



B. Source Side Game 

We consider the two sources as two buyers who want to optimize their secrecy rates, while 
the cost paid for the "service", i.e., jamming power pf, i e J\f, should also be taken into 
consideration. For the source side we can define the utility function as 

U s = a {C{ + C s 2 ) - M, (25) 

where a is a positive constant representing the economic gain per unit rate of confidential 
data transmission between the two sources, and M is the cost to pay for the friendly jammers. 
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Here we have 

M = Y,m lV l (26) 

where rrii is the price per unit power paid for the friendly jammer i by the sources, i 6 N . 

When considering the optimal transmitting power vector of source S± and S 2 , i.e., (pi,p2) 
to achieve the maximum utility value in (|2~5T) . we can treat the jamming power pj, i G Af, as 
constants since all the nodes transmit with independent power. Thus, we can obtain similar 
results of optimal power solutions as given in Subsection-III-B. But to obtain the optimal 
solutions of pi and p 2 is not our main purpose here. In this subsection, we formulate the 
source side game to study how to effectively utilize the jamming power from different friendly 
jammers in order to achieve the maximum utility value. 

Then the source side game can be expressed as 

max U a = max (a {C[ + C s 2 ) - M) , (27) 
C{ > 0, CJ > 

< pf < p max ,p r = p max , fixed pi, p 2 
The goal of the sources as buyers is to buy the optimal amount of power from the friendly 
jammers in order to maximize the secrecy rate. From (1231) . d24l) . and (|27l) . we have 

/ i_i 1 ij 1 \ 



s.t. 



U s 



aW 



log — \ h log ■ 



1+ a 1 + 



S^/, (28) 



where A x 4 £±*l, A 2 4 £±2l, B x 4 ^ + ^s 2 ,r B a °*+™ s „r t a ^ 

vias 1 ,R P2gs 2 ,R Pi9s lt R P2gs 2 ,R ' PrPigs 2 ,Rgs x ,R 

T a !W L A and L A { e N _ 

PrP29S 2 ,R9Si,R Pl9Sx,R ' P2gs 2 ,R 



By differentiating (|28l) with respect to pf, we have 

dU s aWT x i 



2 Mi + E TijpJ )[l + A 1 + E T hlPl 



+ 



2 ( A 2 + e T 2)i pf J ^1 + A 2 + z T 2M 



77V (29) 
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Rearranging the above equation, when = 0, we can get an eighth order polynomial 
equation as 

(pi) 8 + F^ivi) 1 + Mpft + Mptt + MpIY 

+F ii3 (p{) 3 + F h2 (pf) 2 + Fi&i + F h0 = 0, (30) 
where F i h I = 0, 1, . . . , 7, are formulae of constants A k , B k , T i fc , L i fc , and variables p^, 

k = 1, 2, % e N, j e N but j ^ %, 

The solutions to the high order equation (1301) can be expressed in closed form, but the 
expressions of the solutions are extremely complex and have little necessity for our following 
work. Actually, what to our particular interest are not the closed-form expressions of the 
optimal jamming power, but the parameters that affect these optimal solutions. Thus, the 
optimal jamming power solution can be expressed as 

Pi* = pf (jm, {A k } , {B k } , {T M } , {L k>i } , {pj}^) , (31) 

which is a function of the friendly jammer's price m;, the other jammers' jamming power 
{Pj} jjj> an d other system parameters. Noting that there may be up to eight roots of the 
polynomial equation (1301) . the selected solution should be a real root and can lead to a higher 
value of U s in ([28]) than the other real ones. Subject to the power constraints < pf < p max 
in the game, we can get the optimal strategy as 

Pi_opt = min ( maX {Pi*i 0) ,Pmax) ■ (32) 

If there are no real roots of the equation (1301) . then the optimal strategy will be either p{_ opt = 
or pf opt = Pmax according to which one can achieve a larger U s when other parameters are 
settled. 

Because of the high complexity of the solutions to the high order equation in (IBTI) . we 
further consider a special high interference case to obtain a simple expression of the optimal 
solution. In this special case, we assume that there is one jammer staying very close to the 
malicious relay, so that the interference from the jammer is much stronger than the power of 
the received signals from the sources at the relay. Meanwhile, we also assume that the received 
signal power is much higher than the additive noise, i.e., high signal-to-noise ratio, which 

means a 2 < Pig Sl ,R < p{g.h,R and a 2 < p 2 9s 2 ,R < Pi9j u R- Then ' we have pt£^ < x > 
< 1, < 1, < 1, and < 1. We assume all the left sides of these 

P29S 2 ,R Pi9J 1: -R V'i9.J 1 ,R Pi9Ji,-R 

inequalities which are much smaller than 1 approach zero. Therefore, the utility function of 
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the source side in (1251) can be approximately calculated as 

^ log i + — =-j - log i + -j 

. i A . PrP2gs u R9S 2 ,R\ . A . P2fe,i?\\ J 

+ 1°S 1 + 2~ 7 - log 1 + -J - mi/?! 

V ° 2 Px9Ji,R J \ Px9Jx,r J J 

aW ( PrPigs u R9S2,R Pi9s x ,r . PtP29s 1 ,r9s 2 ,R P29s 2 ,r 



2 V v 2 Pi9Ji,R Pi9ji,r v 2 Pi9JuR Pi9j u r 
aW ( (p r gs 2 ,R -.\Pl9SuR . (Pr9S x ,R ,\P29s 2 ,r\ 1 



2 V V a 2 / 5f Jl>jR V cr 2 7 t/j^jj J p{ 

1 

Pi 



Dx^j-m lP {, (33) 



where £>i 



2 



A _ JVffS^RN PigS lt R _|_ _ Pr-gSi,fl \ P29S 2 ,R 

\ cx 2 / V o- 2 / gj 1>fl 



and the second approximation 



comes from the Taylor series expansion log (1 + x) ~ x when x is small enough Q. It can 
be easily observed that if D\ < 0, U s is a decreasing function of pf. As a result, U s can be 
optimized when pf = 0, i.e., the jammer would not play in the game. If D\ > 0, in order to 
find the optimal power for the sources to buy, we can calculate 

Try = -^-i - mi = 0. (34) 

dpf (p{f 

Hence, the optimal closed-form solution can be expressed as 

j* i Di 



Pi= \/— • (35) 
V mi 

By comparing p{ * with the power under the boundary conditions, we can obtain the optimal 
solution of the source side game for this special case as 

Pl_opt = min (Pi*,Pmax) • (36) 

In Section- V, we employ the general case setups for simulation. The results indicates that 
the sources always prefer to buying power from only one jammer when there exists at least 
one sufficiently-effective jammer, which is more effective to perform jamming than the other 
jammers and is simultaneously asking for a proper price. Therefore, this special case with one 
jammer experiencing severe interference is valid in analyzing the proposed game. Under this 
special case, we can get a property of the proposed game that the optimal power consumption 
p{* is a monotonous function of its price mi, which could help to prove the existence of 
the equilibrium in Subsection-IV-D. We can also prove that the friendly jammer power p{ 
bought from the sources is convex in the price mi under some conditions. 

From (1331) . we have the first order derivative of p{ as 

dvV 1 / — -2 

7 f^ = --/^m 1 2 , (37) 
omi 2 



4 Here we say x is small enough means the high order of x approaches zero. 
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and the second order derivative as 

d 2 p(* 3 



-JD x m x 2 > 0, (38) 



dmi 4 

which indicates in the high interference case, the optimal power p{* is a convex function of 
the price mi. 

C. Friendly Jammer Side Game 

For each friendly jammer, we can define the utility function as 

U i = m i (pf) Ci , (39) 

where q > 1 is a constant to balance the payment rriipf from the sources and the transmission 
cost pf of the jammer itself, i E M. With different values of q, the jammers have different 
strategies for asking the price m;. Here, the jamming power pj is also a function of the 
vector of prices (mi,m 2 , . . . ,mjv), as the amount of jamming power that the sources will 
buy also depends on the prices that the friendly jammers ask. Hence, the friendly jammer 
side game can be expressed as 

max Ui. (40) 

mi 

The goal of each friendly jammer as a seller is to set an optimal price in order to maximize 
its utility. By differentiating the utility in (1391) and setting it to zero, we can get 



UWJ +m i Ci{p ijvt ) ^r-^ = 0, (41) 



which is equivalent to solve 

J / J u Pi_opt 



(pL P r~ 1 (pio Pt + rnJ-^)=0. (42) 



The equation (1421) can be solved by setting either pj opt = or 

dp{ 



Pio P t + m tCl ^= 0. (43) 



Hence, with the optimal solution pj opt , we can get the solution of the optimal price mj_ opt 
as a function given as 



in, 



_ op t = m* {a 2 , g Sl)R , g S2 ,R, {gj^n}} , (44) 



where should be positive; otherwise, the friendly jammer would not participate the 

game, i e Af. 



16 



D. Stackelberg Equilibrium of the Proposed Game 

In this subsection, we investigate the Stackelberg equilibrium of the proposed game, at 
which neither the sources nor each friendly jammer can further improve its utility by changing 
its own strategy only. From the game definitions of the source side in (T27l) and the friendly 
jammer side in (1401) . we can define the Stackelberg equilibrium as follows: 

Definition 1: p{_ SE and mi_ SE are the Stackelberg equilibrium of the proposed game, if 
when rrii is fixed, 

U s({pIse})= sup E/. ({*>/}), VieJV, (45) 

■ma x ■ v; 

and when pf is fixed, 

Ui {mi_ SE ) = sup Ui (rrii) , Vi G Af. (46) 

From the analysis in the previous two subsections, we have that {p/opt}"^ in (1321) and 
{ m i_o P t}f = i in (l44l) are the optimal solutions of the jamming power needed by the sources 
and the asking prices given by the friendly jammers when solving the utility optimization 
problem in (1271) and (1401) . And thus, we can obtain the property that the pair of {p/ opt } i _ 1 
and {mi_ op t}f =l is the Stackelberg equilibrium of the proposed game. 

We can easily prove this property theoretically in the special high interference case with 
one efficient friendly jammer close to the untrusted relay. In Subsection-IV-B, we have proved 
that in this special case, the optimal jamming power solution p{ opt bought from the efficient 
friendly jammer 1 is monotone decreasing and convex with the asking price m l5 when the 
other friendly jammers' prices are fixed. And in this case, the sources would prefer to buy the 
jamming power only from the efficient friendly jammer. Therefore, we can obtain that there 
exists a unique Stackelberg equilibrium that are just the optimal solutions of the jamming 
power and the asking prices. However, due to the extremely complex closed-form expressions 
of the optimal solutions, for more general cases, the proof in theory is intractable, and thus 
instead, we prove by simulations in Section-V that the proposed game can effectively converge 
to a unique Stackelberg equilibrium, which are the optimal solutions of the jamming power 
and the asking price when maximizing the utilities of the sources and the friendly jammers. 

E. Distributed Updating Algorithm 

In this subsection, we construct a corresponding distributed updating algorithm for the 
proposed game to converge to the Stackelberg equilibrium defined above. By rearranging 
(l43l . we have 

m i = J l (m) = -%i, (47) 

1 dm; 
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where m = [mi, 777,2, • • • , ttijv] . P 4 op < is a function of m, and ij (m) is the price update 
function for friendly jammer i, i E M. The information for updating can be obtained from 
the sources, which is similar to the distributed power allocation ll40l . The distributed algorithm 
can be expressed in a vector form as 

m(t + l) =I(m(t)), (48) 

where I = [h, I 2 , ijv] T , an d the iteration is from time t to time t + 1. 

Furthermore, we can get the convergence of the proposed scheme by proving that the 
update function in (l48l is a standard function iflTTl defined as 

Definition 2: A function I (m) is standard, if for all m > 0, the following properties are 
satisfied 

1) Positivity: I (m) > 0, 

2) Mono tonicity: if m > m', then I (m) > I (m'), or I (m) < I (m'), 

3) Scalability: For all 77 > 1, 77I (m) > I (77m). 

Similarly as the situation in PTTl . we can get that each friendly jammer's price will converge 
to a fixed point, i.e., the Stackelberg equilibrium in our game, from any feasible initial 
price vector m(0). The positivity of the update function is easy to prove. We know that if 
the price m t goes up, the sources would buy less from the friendly jammer i. Therefore, 



drrii 



in (1471) is negative, and we can prove the positivity of the price update function as 



U (m) = > 0, Vz G AT. 

i — opt 

Because of the complexity of the optimal solutions in (1321) and (l44l) . the monotonicity and 
scalability can only be shown in the high interference case. From (1331) and (1471) . we have the 
update function as 



{pi ) v ™i 2m i 

dpf 



J 1 (m) = -^-^ = ^_^ = ^, (49) 



which is monotonically increasing with the price m 1 and scalable. 

For more general cases, the analysis is intractable. But in Section- V, we employ the general 
case setups for simulation and the results show that the proposed distributed game scheme 
can converge and outperform the case without jammers. 



F. Centralized Scheme for Comparison 

Traditionally, the centralized scheme is employed assuming that all the channel information 
is known. In this subsection, we formulate the centralized problem by optimizing the secrecy 
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rate with respect to the constraints of maximal jamming power p ma x- 

max C s = max {C{ + C s 2 ) , (50) 

Pi vl 

s.t. 0<pf<p max j 

where C{ and C 2 are obtained from (1231) and ([241) . respectively. The centralized solution is 
found by maximizing the secrecy rate only. 

In Section- V, we compare the proposed distributed algorithm with this centralized scheme. 
From the simulation results, we can see that the distributed solution and the centralized 
solution are asymptotically the same when the unit rate's gain a in (|27l) is sufficiently large. 
However, the distributed solution only needs to update the difference of the power and price 
to be adaptive, while the centralized solution requires all channel information in each time 
slot. Therefore, the distributed algorithm is more efficient in practical applications. 

V. Simulation Results 

To investigate the performances, we conduct the following simulations. For simplicity and 
without loss of generality, we consider a two-way relay system where the sources Si, S2, and 
the malicious relay R are located at the coordinate (—1,0), (1,0), and (0,0), respectively. 
The other simulation parameters are set up as follows: The maximum power constraint p ma x 
is 10; the transmission bandwidth is 1; the noise variance is a 2 = 0.01; Rayleigh fading 
channel is assumed, where the channel gain consists of the path loss and the Rayleigh fading 
coefficient; the path loss factor is 2. Here, we select a = 1 for the source side utility in (1271) . 

For the special case without jammers, we set the jamming power up to zero. In Fig. [2] and 
Fig. [3l we show the secrecy rate as a function of the two sources' transmitting power pi and 
p 2 in this special case. It shows that the optimal power vector of (pi,P2) is (0.22p max ,p max ) 
when g SuR = 0.3857 and g S2 , R = 0.0443, and (p ma x, O.S2p max ) when g SuR = 0.0508 and 
9s 2 ,R — 0.3018. After further calculation, we can see that the results agree with the optimal 
power allocation conclusions given in Section III. 

For the single-jammer case, we consider two jammer locations which are (0.3, 0.4) and 
(0.6,0.8). Fig. H] shows the secrecy rate as a function of the jamming power when pi, p 2 , 
and p r are all set up to p max . We can see that with the increase of the jamming power, the 
secrecy rate first increases and then decreases. There indeed exists an optimal point for the 
jamming power. Also the optimal point depends on the location of the friendly jammer, and 
we can find that the friendly jammer close to the malicious relay is more effective to improve 
the secrecy rate. Fig. \5\ shows that the optimal amount of the jamming power bought by the 
sources depends on the price requested by the friendly jammer. We can see that the amount 
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of bought power gets reduced if the price goes high and the sources would even stop buying 
after some price point. And thus there is a tradeoff for the jammers to set the price. If the 
price is set too high, the sources would buy less power or even stop buying. 

For the multiple-jammer case, we consider two jammers which are located at (0.3, 0.4) 
and (0.5, 0.5), respectively. Note that in the simulations of this scenario the jammers are both 
sufficiently-effective ones. Here we define jammer i as a sufficiently-effective one if it on 
its own can offer a power pf, p{ e (0,p ma x], making the secrecy rate improved up to the 
maximal value. In Fig.[6l Fig.|7J Fig.[H and Fig.|9l the secrecy rate C s , the sources' utility U s , 
the first jammer's utility U\, and the second jammer's utility [7 2 as functions of both jammers' 
prices are shown, respectively. In Fig. [6l we can see that there exists an upper bound and 
a lower bound for the secrecy rate when the channel conditions are settled. When one of 
the two jammers' prices is low enough, the sources could buy sufficient jamming power 
from the jammer to improve the secrecy rate up to the upper bound. When both jammers' 
prices are beyond the sources' payment ability, the secrecy rate would stay at the lower 
bound which is the same as the case without jammers as the sources no longer buy jamming 
power from the jammers. In Fig. [7J we can see that if at least one of the two jammers sets 
a low price, the sources' utility is high as the sources could get a high secrecy rate at a 
low cost from the jammer with low asking price. With both jammers' prices going high, 
the sources' utility decreases. When the prices of both jammers are so high that the sources 
cannot benefit any more from the jamming service at the high cost, the sources would stop 
buying. In Fig. [8] and Fig. [9} we can find that under this condition (i.e., both of the jammers 
are sufficiently-effective ones) the sources would always prefer to buying service from only 
one of the friendly jammers for the best performance. The selected jammer is either more 
effective to improve the secrecy rate when the prices of the jammers are comparable, or the 
one whose asking price is low enough. For the friendly jammer side, if the jammer asks 
too low price, the jammer's utility is very low. But if the jammer asks too high price, the 
sources might buy the service from the other friendly jammer. Therefore, there is an optimal 
price for each friendly jammer to ask, and the sources would always select the one which 
can provide the best performance improvement. 

For the multiple-jammer case, we also discuss how the optimal secrecy rate changes with 
the number of jammers increasing. We conduct simulations under two different scenarios, 
i.e., there exists at least one sufficiently-effective jammer and there is no sufficiently-effective 
jammer. Here no sufficiently-effective jammer means that the sources could not achieve the 
maximal secrecy rate with only one jammer's help. Therefore, the sources have to first sort 
the current friendly jammers in an order of effectiveness, and then buy jamming power from 
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the most effective jammer one by one until the secrecy rate reaches the maximal value. From 
(1231) and (|24l) . we can get that if the channel information and the transmitting power of the 
sources and the relay are settled, the maximal achievable secrecy rate will not change, no 
matter how many jammers are used and how much jamming power bought by the sources 
from each jammer. For the non- sufficiently-effective jammer scenario, we set the jammer i 
located at (x i} yi),i G TV, where xj + yf > 4. In Fig. |T0l the optimal secrecy rate as a function 
of number of friendly jammers is shown. We can see that if there exists sufficiently-effective 
jammers, the optimal secrecy rate does not change as the number of jammers increases. For 
the sources, they always choose the most effective jammer to achieve the maximal secrecy 
rate, when additional jamming power from other jammers would decrease the optimal secrecy 
rate. In other words, in this scenario, the optimal secrecy rate can reach the maximal value 
with the most effective jammer's help only. On the other hand, if there are no sufficiently- 
effective jammers, the optimal secrecy rate will be improved up to the maximal value as the 
number of jammers increases. 

Finally, we compare the distributed solution with the centralized solution of secrecy rate. 
In Fig. [HI we show the optimal secrecy rate of both the distributed and centralized schemes 
as a function of the gain factor a in (T25T) with the friendly jammer located at (1,1). When a 
is large, the sources would care the gain of secrecy rate more than the jamming service cost 
in the game theoretical scheme. We can see that the performance gap between the distributed 
and centralized solutions of optimal secrecy rate is shrinking as a is increasing. And the 
solutions are asymptotically the same when a is sufficiently large. 

VI. Conclusions 

In this paper, we have investigated the physical layer security for two-way relay communi- 
cations with untrusted relay and friendly jammers. As a simple case, a two-way relay system 
without jammers is first studied, and an optimal power allocation vector of the sources and 
relay nodes is found. We then investigated the secrecy rate in the presence of friendly jammers. 
Furthermore, we defined and analyzed a Stackelberg type of game between the sources and 
the friendly jammers to achieve the optimal secrecy rate in a distributed way. Finally, we 
obtained a distributed solution for the proposed game. From the simulation results we can get 
the following conclusions. First, a non-zero secrecy rate for two-way relay channel is indeed 
available. Second, the secrecy rate can be improved with the help of friendly jammers, and 
there is an optimal solution of jamming power allocation. Third, there is also a tradeoff for 
the price a jammer sets, and if the price is too high, the sources will turn to buying from 
others. For the game, we can see that the distributed algorithm and the centralized scheme 
have similar performances, especially when the gain factor is sufficiently large. 
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■> Phase 2 

Fig. 1. System model for two-way relay communications with friendly jammers 
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Fig. 2. Secrecy rate vs. pi and p2 for the case without jammers when gs 1: R > gs 2 ,R 
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Secrecy Rate vs. p1 and p2 








Fig. 3. Secrecy rate vs. pi and p2 for the case without jammers when gs lt R < 9s 2 ,R 
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Secrecy rate vs. jamming power 
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Optimal Jamming Power Bought vs. Jammer Price 
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Fig. 5. How much jamming power the sources buy as a function of the jammer price 
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Fig. 6. Secrecy rate C s vs. the prices of both jammers 
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Fig. 7. The sources' utility U s vs. the prices of both jammers 



Jammer 1 Utility U 1 vs. m and m 2 



0.25 




Fig. 8. The jammer l's utility Ui vs. the prices of both jammers 



27 



Jammer 2 Utility U 2 vs. m and m 2 
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Fig. 9. The jammer 2's utility U2 vs. the prices of both jammers 



Secrecy Rate vs. Number of Jammers 
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Fig. 10. Secrecy rate vs. number of jammers 
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Centralized and Distributed Solution of Secrecy Rate vs. the Price Factor a 
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Fig. 11. Centralized and distributed solution of secrecy rate vs. the gain factor 



